Improved performance of QCD code on ALiCE

نویسنده

  • Z. Sroczynski
چکیده

In a typical lattice QCD project the total runtime of code on a supercomputing platform is often measured in months or even years. This means that even a modest improvement in the performance of the code can yield very tangible benefits. There are two aspects to the optimisation of code for parallel machines: single-node optimisation and the minimisation of the overhead incurred by inter-node communications. The former requires that the code be written to take full advantage of the high performance available from todays advanced hardware, The latter is of particular importance on cluster machines, like ALiCE , where the scalability of code can be a serious problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effect of Dietary Supplementation of Aspergillus Xylanase on Broiler Chickens Performance

The effect of Aspergillus xylanase (ASXYL) supplementation to maize-soybean diets on serum aspartate aminotransferase, serum alanine aminotransferase, microbial examination, growth traits, carcass characteristics and meat quality traits of broiler chickens was investigated. Three hundred one-day-old mixed sex “Cobb 500” broiler chicks were allotted to 5 dietary treatments with 5 replic...

متن کامل

Lattice QCD Production on Commodity Clusters at Fermilab

Large scale QCD Monte Carlo calculations have typically been performed on either commercial supercomputers or specially built massively parallel computers. Commodity clusters equipped with high performance networking equipment present an attractive alternative, achieving superior performance to price ratios and offering clear upgrade paths. The U.S. Department of Energy, through the SciDAC (Sci...

متن کامل

Development of an object oriented lattice QCD code "Bridge++"

We are developing a new lattice QCD code set “Bridge++” aiming at extensible, readable, and portable workbench for QCD simulations, while keeping a high performance at the same time. Bridge++ covers conventional lattice actions and numerical algorithms. The code set is constructed in C++ with an object oriented programming. In this paper we describe fundamental ingredients of the code and the c...

متن کامل

Writing Efficient QCD Code Made Simpler: QA0

A new tool for writing platform-independent optimized QCD code, QA 0 , is described. Performance of a Möbius Domain Wall Fermion inverter written with qa0 on several platforms is presented .

متن کامل

Code Optimization on Kepler GPUs and Xeon Phi

Kepler GTX Titan Black and Kepler Tesla K40 are still the best GPUs for high performance computing, although Maxwell GPUs such as GTX 980 are available in the market. Hence, we measure the performance of our lattice QCD codes using the Kepler GPUs. We also upgrade our code to use the latest CPS (Columbia Physics System) library along with the most recent QUDA (QCD CUDA) library for lattice QCD....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002